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Bounding the Greedy Strategy in Finite-Horizon String Optimization 

Yajing Liu, Edwin K. P. Chong, and Ali Pezeshki 


Abstract — We consider an optimization problem where the 
decision variable is a string of bounded length. For some time 
there has been an interest in bounding the performance of the 
greedy strategy for this problem. Here, we provide weakened 
sufficient conditions for the greedy strategy to be bounded by 
a factor of (1 — (1 — 1 / K) 1: j, where K is the optimization 
horizon length. Specifically, we introduce the notions of K- 
submodularity and A'-GO-concavity, which together are suffi¬ 
cient for this bound to hold. By introducing a notion of curvature 
rj e (0,1], we prove an even tighter bound with the factor 
(l/r?)(l — e~ v ). Finally, we illustrate the strength of our results 
by considering two example applications. We show that our 
results provide weaker conditions on parameter values in these 
applications than in previous results. 

I. Introduction 

In a great number of problems in engineering and applied 
science, we are faced with optimally choosing a string (finite 
sequence) of actions over a finite horizon to maximize an 
objective function. The problem arises in sequential decision 
making in engineering, economics, management science, and 
medicine. To formulate the problem precisely, let A be a set 
of possible actions. At each stage i, we choose an action 
a,i from A. Let A = (oi, <22,.. •, a*,) denote a string of 
actions taken over k consecutive stages, where ^ £ A for 
i = 1,2,..., k. Let A* denote the set of all possible strings 
of actions (of arbitrary length, including the empty string 
0). Finally, let / : A* — > R be an objective function, where 
R denotes the real numbers. Our goal is to find a string 
Ms A*, with a length |M| not larger than K (prespecified), 
to maximize the objective function: 

maximize /(M) 

subject to Mg A*, |M| < K. (1) 

The solution to ([]}. which we call the optimal strategy, is 
hard to compute in general. One approach is to use dynamic 
programming via Bellman’s principle (see, e.g., [1] and [2]). 
However, the computational complexity of this approach 
grows exponentially with the size of A and the horizon length 
K. On the other hand, the greedy strategy, though suboptimal 
in general, is easy to compute because at each stage, we only 
have to find an action to maximize the step-wise gain in the 
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objective function. But how does the greedy strategy compare 
with the optimal strategy in terms of the objective function? 

The above question has attracted widespread interest, with 
some key results in the context of string-submodularity 
(see, e.g., [3], [4], [5], and [6]). These papers extend the 
celebrated results of Nemhauser et al. [7], [8], and some 
further extensions of them (see, e.g., [9], [10], [11], and 
[12]), on bounding the performance of greedy strategies in 
maximizing submodular functions over sets, to problem 0 
that involves maximizing an objective function over strings. 
In particular, Streeter and Golovin [3] show that if, in 0, 
the objective function / is prefix and postfix monotone 
and has the diminishing-return property, then the greedy 
strategy achieves at least a (1 — e _1 ^approximation of the 
optimal strategy. Zhang et al. [5], [6] consider a weaker 
notion of the postfix monotoneity and provide sufficient 
conditions for the greedy strategy to achieve a factor of at 
least (1 — (1— 1/K) k ), where K is the optimization horizon 
length, of the optimal objective value. They also introduce 
several notions of curvature, with which the performance 
bound for the greedy strategy can be further sharpened. 

But all the sufficient conditions obtained so far involve 
strings of length greater than K, even though 0 involves 
only strings up to length K. This motivates a weakening of 
these sufficient conditions to involve only strings of length 
at most K, but still preserving the bounds here. 

In this paper, we introduce the notions of A'-submodularity 
and /f-GO-concavity, which together are sufficient for the 
(1 — (1 — \/K) k ) bound to hold. By introducing a notion 
of curvature p £ (0,1], we prove an even tighter bound with 
the factor (1/77)(1 — e~ v ). Finally, we illustrate the strength 
of our results by considering two example applications. We 
show that our results provide weaker conditions on parameter 
values in these applications than in previous results reported 
in [5] and [6]. 

II. Review of Related Work 

In this section, we first introduce some definitions related 
to strings and curvature. We then review the main results 
from [5] and [6]. Specifically, the results there provide 
sufficient conditions on the objective function / in 0 such 
that the greedy strategy achieves a (1 — (1 — 1/K j^j-bound. 

A. Strings and Cun’ature 

For a given string A = (01,02 ,..., a*,), we define its 
length as k, denoted |A| = k. If M = (0™, a™,..., a£() 
and N = (a", 0 %,..., a£ 2 ) are two strings in A*, we 
write M = N if |M| = \N\ and a™ = a™ for each 
i = 1,2,..., |M|. Moreover, we define string concatenation 


as M©TV = (a ™, a™,, aj™, a", , - • ■, a£ 2 ) • If M and 

TV are two strings in A*, we write MAN if we have 
TV = M© L for some L £ A*. In this case, we also say that 
M is a prefix of TV. 

A function from strings to real numbers, / : A* —> R, is 
string submodular if 

i. f has the prefix-monotone property: VM,TV £ A*, 
f(M © TV) > f(M). 

ii. / has the diminishing-return property: VM A TV £ 
A*,Va £ A, /(M®(a))-/(M) > /(TV®(a))-/(TV). 

A function from strings to real numbers, / : A* —>■ R, is 
postfix monotone if 

VM, TV £ A*, /(M © TV) > /(TV). 

The total backward curvature of f is defined as 

a = max 
aeA.Me A* 

f (/((«)) - /(0)) - (/((«) 0 M) - /(M)) ) 

l /((«)) - /(0) /' 

A. Bounds for the Greedy Strategy 

We now define optimal and greedy strategies for problem 
0 and some related notation. 

(1) Optimal strategy: Any solution to 0 is called an 
optimal strategy. If / is prefix monotone, then there 
exists an optimal strategy with length K, denoted Ok = 
(oi,...,Oif). Let Oi = (oi,...,Oj) for i = 1,... ,K. 

(2) Greedy strategy: A string G k = (gi, g 2 , • ■ ■, g k ) is 
called a greedy strategy if Vi = 1, 2, ..., k, 

Qi S arginax/((sfi,£ 2 ,--.,ffi-i,3))- 

36 A 

Let Gi = {gi ,..., gf) for i = 1,..., AT. 

The following two theorems summarize the performance 
bounds in [5] and [6]. 

Theorem 1: If / is string submodular and /(G, ® Ok) > 
/( Ok ) holds for all i = 1 ..... A' 1, then any greedy 
strategy Gk satisfies 

f(Gx) > (l - (l - ^) ) f(Ox) > (1 - e-^fiOK)- 

Theorem 2: If / is string submodular and postfix mono¬ 
tone, then any greedy strategy Gk satisfies 

/<o K )>I(i-(i-J)“) no,,) 

>1(1 -e~°)f(0 K ) 

>(l-e- 1 )f(0 K ). 

Under additional assumptions on the curvature a of /, [5] 
and [6] provide even tighter bounds. Notice that the sufficient 
conditions above involve strings of length greater than K, 
even though the problem 0 involves only strings up to 
length K. This motivates a weakening of these sufficient 
conditions to involve only strings of length at most K, but 
still preserving the bounds here. In the next section, we 
present our main results along these lines. In Section Hvl we 


show that these weakened sufficient conditions also lead to 
weaker requirements than in [5] and [ 6 ] for two application 
examples. 

III. Main Results 

Before stating our main results, we first introduce some 
definitions on / : A* —> R. 

i / is K-monotone if VM, TV £ A*, and |M| + |TV| < K, 
f (M © TV) > /(M). 

ii. / is K-diminishing if VM A TV £ A* and |TV| < AT—1, 
Va £ A, /(M © (a)) - /(M) > /(TV © (a)) - /(TV). 

iii. / is K-submodular if it is both AT-monotone and AT- 
diminishing. 

iv. Let Gi = ( 51 ,..., < 7 i) (as before) and Ok-i = 

(oj + i,..., Ok) for i = 1,..., AT. Then, / is K-GO- 
concave if for 1 < i < K — 1, 

f(Gi © 0 K -i) > ^f(Gi ) + (i - £) f(0 K ). 

Notice that these definitions involve only strings of length 
at most K. Moreover, it is clear that if / is string sub¬ 
modular, prefix monotone, and has the diminishing-return 
property, then / is string AT-submodular, AT-monotone, and 
AT-diminishing. Under these weaker conditions, we show that 
the previous bounds on the greedy strategy still hold. 

Theorem 3: If / is AT-submodular and AT-GO-concave, 
then 

f(Gx ) > ^1 - (l - 1) ^ f(0 K ) > (1 - e _ 1 )/(Oif). 

Proof: Because / is A'-diminishing, we have that for 

1 < i < AT, 

f{oi)>f(Oi)-f(Oi- 1 ). 

By definition of the greedy strategy, for 1 < i < AT, 

f{Gi) > f(oi) > f{Oi) — f(Oi-i). 

Summing the inequality above over i from 1 to K produces 

K K 

E/(Gi)>E(/(°>)-/(°-1)) 

2=1 2=1 

=► Kf(G i) > f(0 K ) 

=> f(Gi) > ±f(0 K )■ 

For 1 < i < K — 1, because / is A'-diminishing, we have 

/(Gi ® o K ) ~ f(Gi) 

> f{Gi © (o»+i, • • • , ok)) — f(Gi © (oi+i, • • • , ok- i)), 
f(Gi®o K -i)- /(Gi) 

> f{Gi © (oj_|-i, • • • , ok-i)) — f(Gi © (oj+i, • • • , 0 K- 2 )), 

/(GiffiOi+a)- /(Gi) 

> /(Gj © (oj+i, 0 ^+ 2 )) — /(Gj © (o i+ i)), 

/(Gi © o i+1 ) - /(Gi) > /(Gi © (oi +1 )) - /(Gi)). 



Summing the inequalities above, we have 

K 

J2 (/( G ‘ ® °i) - /( G 0) ^ /( G ‘ ® °K-i) - 

i=»+i 

By definition of the greedy strategy, we have for i +1 < j < 

K. 

f(G i+1 ) - f{Gi) > f(G t © 0j ) - f(Gi), 

which implies that 

K 

{K - *)(/(G <+1 ) - /(GO) > E (/( G ‘ ® °i) - /( G 0)- 

j=*+i 

Hence, we have for 1 < i < K — 1, 

/(Gj+r) - /(GO > —^—:(/(Gi © Ox-i) - /(GO). (2) 

By i^-GO-concavity, for 1 < i < K — lwe have 

/(Gi+i) - /(GO 

> 77 ^—(/(Gi © Ox-*) - /(GO) 

K — z 

= ^(f(0 K )-f(G i )), 

from which we get 

/(G i+ 1 )>^/(0*) + (l--i) /(GO- 

Therefore, 

/(Gx) > -^/(Ox) + (l - -0 /(Gk-i) 

= ( 1 -( 1 -i)E (OK> ' 

Because 1 — (l — -2) \ 1 — e _1 as AT —>■ oo, we also 

have 

/(Gk) > ^1 - (l - -^) ^ /(Ox) > (1 - e-0/(Ox). 

■ 

Next, we introduce a new notion of curvature 77 as follows: 

77 = max 
l<z<iC—1 

f Kf{Gj) - (Kf(Gj © 0 K -i) - (K - i)f{Q K )) ) 

\ (K - i)/(G0 J • 

If / is K -GO-concave, then for 1 < i < K — 1 we have 

Kf(Gi)—(Kf(Gi © Ox-0 - {K - *)/(Ox)) 

< Kf(Gi) - if(Gi) 

= {K — i)f(Gi), 


which implies that 77 < 1. The following theorem gives a 
bound related to the curvature //. 

Theorem 4: If / is AT-submodular and /f-GO-concave, 
then 

/ (Gk) > i (i - (i -10)/ ( 0k) 

> -(l- e -”)/(Ox). 

n 

Proof: By definition of the curvature ?/, we have 

f{G% © Ox-0 - /(GO > —jr-{f(0 K ) - r,f(Gi)). 

By definition of the greedy strategy and inequality Q. 
we have 

f(Gi+i) - /(GO > ^—(f(Gi © Ox-0 - f{Gi)) 

K — z 

> —: ‘ —(/(Ox) — t?/( G 0) 

A — z A 

= -f(/(Ox)-t?/(G0), 

from which we get 

/(Gi+i) > ^/(Ox) + (l - -|) /(GO- 

Therefore, 

/«?*) > 2,/(0 K ) + (l - £) /(&-,) 

=H 1 - ( x - *r) n ° K) - 

Because 2 ^1 — (l — -2,) A ^ \ 2(1 — e ~ v ) as iT —» oo, we 
also have 

/( G ^)^^( 1 -( 1 -|) )/(°^) 

> -(l-e-”)/(Ox). 

1 

■ 

Remarks 

• The term 2(1 — e~ v ) is decreasing in ?/ £ (0,1]. 

« When 77 = 1, 2(1 — e~ v ) = 1 — e _1 , which is the 
bound in Theorem 0 Moreover, for 0 c 77 c i, i(i 
e~ v ) > 1 — e -1 . Hence, Theorem[4]is a generalization 
of Theorem [3] and gives a tighter bound. 

« When 77 —> 0, we have 2(1 — e~ v ) —>• 1, making the 
greedy strategy asymptotically optimal. 

IV. Applications 

In this section, we consider two example applications, 
namely task assignment and adaptive measurement design, to 
illustrate the strength of our results. In each case, we derive 
sufficient conditions, on the parameter values of the problem. 











for the greedy strategy to achieve the (1 — (1 — 1/K) k ) Thus, a sufficient condition for / to be I\ -diminishing is 
bound. These sufficient conditions are weaker than those we 

previously reported in [5]. L > (1 - L)U. (4) 


A. Task Assignment Problem 

As our first example application, we consider the task 
assignment problem that was posed in [3] and was further 
analyzed in [5], In this problem, we have n subtasks and a set 
A of A' agents. At each stage, we get to assign a subtask to 
an agent, who accomplishes the task with some probability. 
Let pf (a) denote the probability of accomplishing subtask 
i at stage j when it is assigned to agent a £ A. Assume 
that pi (a) £ [Li(a),Ui(a)}, 0 < A* (a) < Ui{a) < 
1 , and that the limits of the interval are independent of 
the stage in which subtask i is assigned to agent a. Let 
X,{ai. a ,2 ■ ■ ■ ■, o-k ) denote the random variable that describes 
whether or not subtask i has been accomplished after the 
sequence of assignments (an, <22, ..., a k ) over k steps. Then, 
— Xi(di, a- 2 , • ■ •, (tfe) is the fraction of subtasks ac¬ 
complished by employing agents (ai, ci2> • • •, a k ) over k 
steps. The objective function / for this problem is the 
expected value of this fraction, which can be written as 

-t n / k 

f((ai,...,a k )) = 1 1 — IT ( 1_ ^'( a J')) 

i=i \ j =1 

We wish to derive sufficient conditions on the set of pa¬ 
rameters {(L(a),U(a)) |a £ A} so that / is A'-monotone, 
A'-diminishing, and A'-GO-concave. 

For simplicity, we consider the case of n = 1. But our 
results can easily be generalized to the case where n > 1. 
For n = 1, the objective function / reduces to 

k 

f((a 1 ,...,a k )) = 1 - Ila-riC-i)), (3) 
j=i 

and from here on we simply use p?(a,j) in place of p[(dj). 

It is easy to check that / is AT-monotone. For / to be 
A'-diminishing, it suffices to have 

f(M © (a)) - f(M) > f(M © ( b ) © (a)) - f(M © (6)), 

for any a, b £ A and for any M £ A* with \M\ < K — 2. 
Let M = (ai,..., a m ), then we have 

p m+ \a) > (1 -p m+1 {b))p m+2 {a). 



Now, let us rearrange the AT-GO-concavity condition as 


(K-i)(f(O K )-f(G i ®0 K _ i )) 

<i(f{Gi@0 K -i)-f(Gi)). 


Replacing for / from (0 gives (after simplifying) 


K 


< i 


{K -i) (1 -p j (o i )) 

j=i+l 
K 

1 n a ~p>( 0i )) 


1 - 


n-=i(i-p>o) 
n;=i(! -pk 9 j)) 


j=i+l 


Because /(Ok) > f(Gi © Ok-i), we have 

<1 

Therefore, to have K- GO-concavity it suffices to have 


K 

(K -i) (1 -p>(oi)) <i 

j=i +1 


K 

1 ~ n (! -p 3 ^)) 

j=i +1 


or equivalently 

K 

n (! - A 3 (o*)) < (5) 

j=i +1 


for 1 < * < AT— 1. If we assume that p l (oi) > 4 for 2 < i < 
K, then it is easy to see that 0 holds for 1 < i < K — 1. 
Thus, a sufficient condition for A'-GO-concavity is 

L>\. ( 6 ) 

If 0 holds then 0 also holds. Thus, 0 is sufficient for the 
greedy strategy to achieve the (1 — (1 — bound. 

Remark 1: The sufficient condition in [5] requires 0 and 


Suppose that L(a) < pi (a) < U(a ) for all a £ A, j = 


1,2,..., AT. Let 


and 


Then, we can write 


U = max U(a) 
aG A 


L = min L(a). 

aeA 


p m+ \a) > L(a) > L 


and 

(1 - p m+ \b))p m+2 (a) < (1 - L(b))U(a) < (1 - L)U. 


p\gi) > l-c K , (7) 


where 


c = 


1 - U(a 
1 - L(a) 


When all pl(cij) > 1/2, then 0 and 0 automatically hold, 
but 0 is not necessarily satisfied. In that sense, the K- 
monotone, A'-diminishing, and AT-Go concavity conditions 
are weaker sufficient conditions for achieving the (1 — (1 — 
-L j K ) bound than the prefix monotone, diminishing-return, 
and postfix monotone conditions of [5], 











B. Adaptive Measurement Problem 

As our second example application, we consider the 
adaptive measurement design problem posed in [13] and 
[5], Consider a signal of interest x £ R 2 with normal prior 
distribution Af( 0, 7), where I is the 2 x 2 identity matrix; 
our analysis easily generalizes to dimensions larger than 2 . 
Let A = {Diag (y/e, y/1 — e) : e £ [0.5,1]}. At each stage 
i, we make a measurement of the form 

Ui = AiX + Wi, 


where Ai € A and Wi is a Gaussian measurement noise 
vector with mean zero and covariance of 7. 

The objective is to choose a string of measurement ma¬ 
trices { A, }* =1 with k < K to maximize the information 
gain: 

/((ai,... ,a k )) = Hq- H k . 

Here H 0 = -rj- log(27T“r ) is the entropy of the prior distribution 
of x and ///, is the entropy of the posterior distribution of x 
given {yi }\ : =1 ; that is, 

1 , x N 

Hk = ^log det(Pfc) + —log(27re), 

where 

p k = ( P fc -1 + ^ k A * Ak ) 

is the posterior covariance of x given {y ,;}* =1 [13], 

We wish to derive sufficient conditions on the set of pa¬ 
rameters {crf}^L 1 so that / is 77-monotone, A'-diminishing, 
and A’-GO-concave. It is easy to see that / is 7f-monotone 
by form, and it is A'-diminishing if { of}fL , is a non¬ 
decreasing sequence, that is, 

vf+i>Vi, for i = 1,2, ...,77- 1. ( 8 ) 

Let Af = Diag(y / e7, y/1 - e/) and A* = 
Diag(y'e*, y/1 — e*) be the greedy and optimal actions at 
stage i, respectively; that is, gi = Af and o; = A*. Then, 
the A'-GO-concavity condition for this problem is that for 
1 < * < K — 1 , we must have 

(s* + sZ-^-Hck (S* + SK_i)) K ~ i sl{ ai - s z y 

< (Si + ^K—i) K {c K - (Si + S* k J)) k , (9) 


where 


•sf=i+E^*. 

7=1 i 


K 


1 


S K~i E a 2 e P 

j=i +1 1 

* 1 

= 1 + E]T2 e 7’ 
7=1 7 

i 1 

ai =2 +, 


j=i J 


c K =2 +yy 


i 

j=i ° 


Because /(Ok) > /(Gj ® Ox-i), we have 

(# + &-<)(<*-(S?+£*-<)) 

> (Si + S* K _i)(cK — (Si + Sx—i))- 

It is easy to check that 

SMi - Si) < (Si + S* K _ t )(c K - (Si + S* K _ t )). 

Therefore, we have 

Si( ai - Si) < (S* + S* K _i)(c K - (S* + S* K _i)). (10) 

Let 


g(i) = (S* + S* K _ i ) K ^(c K -(S* + S* K _ i )) K ^Sl(ai-S i y. 


Then, 


g(i + 1 ) _ Si(cti — Si) 

g (i) ~ (st + s* K _y)(cK - (st + §*«_,))■ 

By ( [Tol l. g(i) is non-increasing. Hence, it suffices to have 


(Si* + S^f-^CK - (S* + s^y^s^a, - Si) 
<(Si+S* K -x) K (ck~(S 1 +S* k _ 1 )) k (11) 


in order to get A'-GO-concavity. 

Let T-i = ai - S u T{ = 0l - Sf, and 

7=2 3 

Then, we can rewrite ( ITT1> as 


(S* + S 


* 

K -1 


) k -\T/ +T^_ 1 )) k ~ 1 S 1 T 1 
< (Si + &k-i) K (T\ + Tk_ x 


) K - 


( 12 ) 


If 


(si + s^yn + T^)) 

= (Si+S* K _ 1 )(T 1 +T* K _ 1 )), (13) 

that is, to have f(Oi) = f(Gi), then (fl 2 T> always holds, 
because SiTi < (Si + S^^Ti + T£_i)). 

We now show that ([T3l > always holds, given the action set 
A considered in this example. In other words, the 77-GO- 
concavity condition is satisfied and this means that ([ 8 ]) is a 
sufficient condition for achieving the (1 — (1 — j^) K ) bound. 

By definition of the greedy strategy, we have f(G i) > 
f(0 1 ), which means 

( 1 + _L gl ) (l + ^l-er)) 

-( i+ ^ e 0 ( i+ ^ (i ^ eD )- 

Simplifying the above inequality gives 


(ei ~ el)(l - (ei + ej)) > 0. 
Because /(Ok) > f(G\ ® O k - 1 ), we have 


K 


i + ^ + yy^. 


fi 

7=2 °7 


i + ilzyil + y^ 1 e * j) 


*1 


7=2 °7 


> 


K 

ei ^ e. 






, =2 




7—2 "j 


( 14 ) 












which implies that 


(ei ~el) 


K 




j—2 


> — (ei - e*)(l - (ei + e*)). 


(15) 


The inequality (IT4l) implies that ei < e\. From (IT5] > and 
<®, we have that 


(ei -e\) 



1) 


> 0 , 


which implies that e\ = e\. Since if e\ ^ e*, then ei < e^, 
which implies that 


(ei - e^) 



1) 


< 0 , 


while 

-4( e i- e i)(l-( e i + e i)) >°: 

a i 


which contradicts ( f]~5l >. Hence, we have e\ = e^, which 
means G i = Or, and the inequality (1 1 3| i holds. 

Remark 2: The sufficient condition in [5] for achieving 
the (1 — (1 — k) K ) bound in this problem requires both dU* 
and 
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where [a, b\ is the interval that contains all the a, s. Therefore, 
the condition derived in this paper is a weaker sufficient 
condition than that obtained in [5], 


V. Conclusion 

We considered an optimization problem CD where the 
decision variable is a string of length at most K. For this 
problem, we reviewed some previous results on bounding 
the greedy strategy. In particular, the results of [5] and 
[6] provide sufficient conditions for the greedy strategy to 
be bounded by a factor of (1 — (1 — 1/K) k ). We then 
presented weakened sufficient conditions for this same bound 
to hold, by introducing the notions of A% submodularity and 
A'-GO-concavity. Next, we introduced a notion of curvature 
it £ (0,1], which furnishes an even tighter bound with the 
factor -(1 — e _I? ). Finally, we illustrated our results by 
considering two example applications. We showed that our 
new results provide weaker conditions on parameter values 
in these applications than in [5] and [6]. 
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